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We introduce a generalisation of the well-known ARCH process, widely used for 
generating uncorrelated stochastic time series with long-term non-Gaussian distributions and 
long-lasting correlations in the (instantaneous) standard deviation exhibiting a clustering 
profile. Specifically, inspired by the fact that in a variety of systems impacting events are 
hardly forgot, we split the process into two different regimes: a first one for regular periods 
where the average volatility of the fluctuations within a certain period of time W is below a 
certain threshold, (j), and another one when the local standard deviation outnumbers (j). In the 
former situation we use standard rules for heteroscedastic processes whereas in the latter case 
the system starts recalling past values that surpassed the threshold. Our results show that 
for appropriate parameter values the model is able to provide fat tailed probability density 
functions and strong persistence of the instantaneous variance characterised by large values of 
the Hurst exponent {H > 0.8), which are ubiquitous features in complex systems. 
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I. Introduction 

For the last years the physical community has broaden its subject goals to matters 
that some decades ago were too distant from the classical topics of Physics. Despite 
being apparently at odds with the standard motivations of Physics, this new trend 
has given an invaluable contribution toward a more connected way of making Science, 
thus leading to a better understanding of the world surrounding us [1]. Within this 
context, the major contribution of physicists is perhaps the quantitative procedure, 
reminiscent of experimental physics, in which a model is proposed after a series of studies 
that pave the way to a reliable theory. This path has resulted in a series of findings 
which have helped such diverse fields as physiology, sociology and economics, among 
many others [21 [3l H]. Along these findings, one can mention the determination of non- 
Gaussian distributions and long-lasting (power-law like) correlations [5l[6l[7]. Actually, 
by changing the observable, the conjunction of the two previous empirical verifications is 
quite omnipresent. For this reason and regardless the realm of the problem very similar 
models have been applied with particular notoriety to discrete stochastic processes of 
time-dependent variance based on autoregressive conditional heteroscedastic models [8] ■ 
That is to say, most of these models are devised taking basically into account the general 
features one aims at reproducing, rather than putting in elements that represent the 
idiosyncracies of the system one is surveying. For instance, many of the proposals cast 
aside the cognitive essence prevailing on many of these systems, when it is well known 
that in real situations this represents a key element of the process [9]. On the other 
hand, intending to describe long-lasting correlations, long-lasting memories are usually 
introduced thus neglecting the fact that we do not traditionally keep in mind every 
happening. As a simple example, we are skilled at remembering quotidian events for 
some period. However, we will discard that information as time goes by, unless the 
specific deed either created an impact on us or has to do with something that has really 
touched us somehow. In this case, it is likely that the fact will be remembered forever 
and called back in similar or related conditions, which many times lead to a collective 
memory effect |1U] . 

In this work, we make use of the celebrated heteroscedastic model, the ARCH 
process [11] and modify it by pitching at accommodating cognitive traits that lead to 
different behavior for periods of high agitation or impact. Particularly, we want to 
stress on the fact that people tend to recall important periods, no matter when they 
took place. To that end, we introduce a measure of the local volatility, as well as a 
volatility threshold, so that the system changes from a normal dynamics, in which it 
uses the previous values of the variable to determine its next value, to a situation in 
which it recalls the past and compares the current state with previous states of high 
volatility, even if this past is far. 
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I.l. Standard models of heteroscedasticity 

The Engle's formulation of an autoregressive conditional heteroscedastic (ARCH) time 
series [TT] represents one of the simplest and effectual models in Economics and Finance, 
for which he was laureated the Nobel Memorial Prize in Economical Sciences in 2003 |12j . 
Explicitly, the ARCH corresponds to a discrete time, t, process associated with a 
variable, z^, 

Zt = (Tt Ut, (1) 

with Ut being an independent and identically distributed random variable with zero mean 
and standard deviation equal to one. The quantity at represents the time-dependent 
standard deviation, which we will henceforth name instantaneous volatility for mere 
historical reasons. Traditionally, a Gaussian is assigned to the random variable tuj, 
but other distributions, namely the truncated a-stable Levy distribution and the q- 
Gaussian (Student-t) have been successfully introduced as well [13], [Hj. In his seminal 
paper, Engle suggested that the values of could be obtained from a linear function 
of past squared values of Zt, 

s 

a^ = a + J2^,zl„ (a,6, >0). (2) 

i=l 

In financial practice, viz., price fiuctuations modelling, the case s = 1 {bi = b) represents 
the very most studied and applied of all the ARCH (s)-like processes. The model has 
been often applied in cases where it is assumed that the variance of the observable 
(or its fluctuation) is a function of the magnitudes of the previous occurrences. In a 
financial perspective, Engle's proposal has been associated with the relation between 
the market activity and the deviations from the normal level of volatility a, and the 
previous price fluctuations making use of the impact function [8]. Alternatively, recent 
studies convey the thesis that leverage can be responsible for the volatility clustering 
and fat tails in flnance [TS]. Nonetheless, the heteroscedastic ARC H-\ike processes has 
been repeatedly used as a forecasting method. In other words, one makes use of the 
magnitude of previous events in order to indicate (or at least to bound) the upcoming 
event (see e.g. [T6| IT7]). In respect of its statistical features, although the time series 
is completely uncorrelated, {zt Zt') ~ Stt', it can be easily verifled that the covariance 
{\zt\ \zt'\) is not proportional to 6tt'- As a matter of fact, for s = 1, it is provable that 
{zf z^,) decays according to an exponential law with a characteristic time r = |ln6|~^. 
This dependence does not reproduce most of the empirical evidences, particularly those 
bearing on price fluctuations studies. In addition, the introduction of a large value of s 
used to give rise to implementation problems [TS]. Expressly, large values of s augment 
the difliculty of flnding the appropriate set of parameters {bi} for the problem under 
study as it corresponds to the evaluation of a large number of fltting parameters. Aiming 
to solve this short-coming of the original ARCH (1) process, the GARCH [s, r) process 
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was introduced [19] (where G stands for generalised), with Eq. ([2]) being replaced by, 

s r 

- + ^ &i zl_i + ^ Ci (7l_i (a, hi, Ci > 0) . (3) 



i=l i=l 

In spite of the fact that the condition, h + c < 1, guarantees that the GARCH {1, 1) 
process exactly corresponds to an infinite-order ARCH process, an exponential decay 
for {zf z^i), with r = |ln {h + c)|~^ is found. 

Although the instantaneous volatility is time dependent, the ARCH{\) process is 
actually stationary with the stationary variance given by, 

{^') = = < 1)' (4) 

(herein (. . .) represents averages over samples at a specified time and denotes averages 
over time in a single sample). Moreover, it presents a stationary probability density 
function (PDF), P{z), with a kurtosis larger than the kurtosis of distribution P{uj)- 
Namely, the fourth-order moment is, 

= <^'> (l-6)(lt_V(a4)) - 

This kurtosis excess is precisely the outcome of the dependence of at on the time (through 
z). Correspondingly, when 6 = 0, the process is reduced to generating a signal with the 
same PDF of u, but with a standard variation equal to ^/a. At this point, it is convenient 
to say that, for the time being and despite several efforts, there are only analytical 
expressions describing the tail behaviour of P{z) or the continuous-time approximation 
of the ARCH{1) process with the full analytical formula still unknown [HI [2(7] . 

In order to cope with the long-lasting correlations and other features such as the 
asymmetry of the distribution and the leverage effect, different versions of the ARCH 
process have been proposed [SI HB]. To the best of our knowledge, every of them 
solve the issue of the long-lasting correlations of the volatility by way of introducing 
an eternal dependence on zf in Eq. hi = hK, (i), with /C (.) representing a slowly 
decaying function [211 [22]. Most of these generalisations can be encompassed within the 
fractionally integrated class of ARCH processes, the F I ARCH [2S1 [211 [2S] • The idea 
supporting the introduction of a power- law for the functional form of (.) is generally 
based on the assumption that the agents in the market make use of exponential functions 
/C (.) with a broad distribution of relaxation times related to different investment 
horizons [26] [27]. This type of model has achieved a huge popularity in the replication 
of non-Gaussian time series in several areas, such as biomedicine, climate, engineering, 
and physics (a few examples can be found in [281 ESI [301 EB ESI [33]). 

As described above, the statistical features of the macroscopic observables are the 
result of the nature of the interactions between the microscopic elements of the system 
and the relation between microscopic as well as the macroscopic observables. In the case 
of the "financial" ARCH process, it was held that zf bears upon the impact of the price 
fluctuations on the trading activity. On the one hand, it is understood that the impact 
of the price fluctuations (or trading activity) on the volatility does not merely come from 
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recent price fluctuations and it does actually involve past price fluctuations. In finance, 
upgraded versions of heteroscedasticity models use multi-scaling, i.e., it is assumed 
that the price will evolve by modulating the volatility according to the volatility over 
different scales (days, weeks, months, years, etc.) [31] in order to smooth their possible 
misjudgement about the volatility. However, in practice, these models do not differ much 
from FIARCH-like proposals at the level of the results we are pointing at. Alternately, 
it is worthwhile to look upon the ARCH proposal as a mechanism of forecast [121 IT?] . 
In this way, the simplest approach, the ARCH{1), represents an attempt to foresee 
future values just taking into account recent observations, whereas models like the 
F I ARCH bear in mind all the history weighting each past- value according to some 
kernel functional. 



1.2. Minding impacting events 

In our case, we want to emphasise the fact that people tend to recall periods of high 
volatility (i.e., impact) in the system, no matter when they took place, by changing the 
surrounding conditions as agent-based models suggested [351 [36] • Hence, we introduce 
a measure of the local volatility, 

^ w-i 

^* = Ti7 E (5) 

i=0 

and a threshold, 0, so that instead of Eq. (I2|), the updating of goes as follows: 

t 

(6) 

t 

K i=l 

where hi = hlC{i) = h exp [— i/r] [2Z]. Therefore, if we assume the financial market 
perspective, we are implicitly presuming that the characteristic time, r, is Dirac delta or 
at least narrow distributed, so that the exponential functional is a valid approximation. 
This approach is confirmed by recent heuristic studies in which it has been verified that 
the largest stake of the market capitalisation is managed by a small number of companies 
that apply very similar strategies [3H]- With the second branch equation we intend to 
highlight the difference in behaviour of the "normal" periods of trading and the periods 
of significant volatility, in which the future depends on the spells of significant volatility 
in the past as well. The values h[ are defined as, 

h[ = hp.,Q[vt^,-<P], (7) 

with ©[...] being the Heaviside function and Pi is a factor that represents a measure of 
the similarity (in the volatility space) between the windows of size W with upper limits 
at zt and Zt-w+i^ respectively. Analytically, this is equivalent to mapping segments in 
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the form \^zf, . . . , zf_y^_^^^ into vectors in M^J"^ and afterward computing a normalised 
internal product-like weight, 

1 ^ 

P^ = jfY.''h''h^ (8) 

i=i 

where, for the sake of simplicity, we set aside the time dependence of pi and h'^ in the 
equations, while M represents the normalisation factor such that 'YliiPi = 1 fo^^ ^ ^ 
(with fixed t). 

We are therefore dealing with a model characterised by 5 parameters, namely: a (the 
normal level of volatility) and h (the impact of the observable in the volatility), which 
were both first introduced by Engle in [11]; r, put forward in exponential models [37] : 
and two new parameters W (representing the volatility spell) and that we will reduce 
to a single extra parameter. If we think of trading activities, our proposal introduces a 
key parameter, the volatility threshold, 0, which signals a change in behaviour of the 
agents in the market. At present, significant stake of the trading in financial markets 
is dominated by short-term positions and thus a good part of the dynamics of price 
fluctuations can be described by Eq. ([2]), or by functions with an exponential kernel. As 
soon as the market fluctuates excessively, i.e., the volatility soars beyond the threshold, 
the market changes its trading dynamics. The main forecast references are obviously 
the periods where the volatility has reached high levels and afterward, the periods of 
those which are most similar; this is the rationale described by our Eq. ([8]). Thence, 
our proposal is nothing but the use of simple mechanisms that in a coarse-grained way 
master a good part of our decisions. 



II. Results 

II. 1. General results 

In this section we present the results obtained by the numerical implementation of the 
model. For comparison, we will use the results of a prior model that can be enclosed in 
the class of F I ARCH processes [25]. There, the adjustment of the parameters comes 
from the delicate balance between the parameter 6, which is responsible for introducing 
deviations of the volatility from its normal level a, and the parameter controlling the 
memory. On the one hand, large memory has the inconvenient effect of turning constant 
the instantaneous volatility, so that after a seemly number of time steps the value of 
cr becomes constant, hence leading to a Gaussian (or close to it) distribution of the 
variable z, independently of how large h is. On the other hand, short memory is unable 
to introduce long-range correlations in the volatility, although it enhances larger values 
of kurtosis excess. The model we introduce herein is rather more complex. In order 
to deal with the change of regime, we define a parameter establishing this alteration 
and we need to specify W and r. Henceforth, we have assumed W = t, which is very 
reasonable as it imposes that the volatility and the time scale that the agents in the 
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market use to assess the evolution of the observable are the same. In order to speed up 
our numerical implementation, we have imposed a cut-off of lOW in the computation 
of the first line in Eq. ([6]). This approximation turns the numerical procedure much 
lighter with a negligible effect because the influence of the discarded past is not much 
relevant in numerical terms (within standard numerical implementation error). In all 
of our realisations, we have used a normalised level of expected volatility, a = 1, and 
we have defined the volatility threshold in units of a/ (1 — 6), following a stationary 
approach, as well. 

We have adjusted the probability distributions of z by means of the distribution, 

P{z) = Z-'{l + Bz^'')^ , (9) 

the behaviour of which follows a power-law distribution for large 1^1 with an exponent 
equal to -tzj and where (using Ref. [39], sec. 3.194), 



i + (-i)"r[^]r 


'2u+{n+l)(l-g')' 
2u{q'-l) 


1 + ^ E'^rr 


1 





z''{l + Bz"')^dz = —^ — 4 4- (10) 



(^■^Zi > I + , and Z represents the previous integral with n = 0. The fittings for the 
probability density distribution ([9]) were obtained using non-linear and maximum log- 
likelihood numerical procedures and the tail exponents double-checked with the value 
given by the Hill estimator [30]. As a matter of fact, values of u different from 1 have 
only been perceived for large values of b and small values of (p (slightly larger) or large 
values of (slightly smaller). For u = 1 and q' ^ 1, the PDF corresponds to a q'- 
Gaussian distribution (or Student-t distribution) [41] and when g' = 1 we have either 
the Gaussian {u = 1) or the stretched distribution {u ^ 1). Since that in the majority 
of the applications one is interested in the tail behaviour, we have opted for following 
the same approach by defining the tail index as, 

7 = ^ q = u-^{q' + u-l). 11 

q — I g — 1 

In spite of the fact that other functional forms could have been used, we have decided 
on Eq. because of its statistical relevance and simplicity (in comparison with 
other candidates involving special functions, namely the hypergeometric) . Moreover, 
the g-Gaussian (t-Student) is intimately associated with the long-term distribution of 
heteroscedastic variables since it results in the exact distribution when the volatility 
follows an inverse-Gamma distribution [531 1321113] • 

Concerning the persistence of the volatility, we have settled on the Detrended 
Fluctuation Analysis (DFA) [33], which describes the scaling of a fluctuation function 
related to the average aggregated variance over segments of a time series of size i, 

F{i)^i^, (12) 

where H is the Hurst exponent. Although it has been shown that Fluctuation Analysis 
methods can introduce meaningful errors in the Levy regime [35] , we have verified that 
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Table 1: Critical values = ^ — otcrit from the Kolmogorov-Smirnov test for typical 
pairs (W, </>) used for adjustments. 

for our case, which stands within the finite second-order moment domain, the results of 
DFA are so rehable as other scahng methods. 

Let us now present our results for h = 0.5, which is able to depict the qualitative 
behavior of the model for small b. This case corresponds to a situation of little deviation 
from the Gaussian, when long-range memory is considered. In accordance, we can 
analyse the infiuence of the threshold and W. Overall, we verify a very sparse 
deviation from the Gaussian. Keeping W fixed and varying 0, we understand that 
for small values of (p the distribution of zt is Gaussian and the Hurst exponent of \zt\ 
is 1/2. It is not hard to grasp this observation if we take into account that, by using 
small values of 0, we are basically employing almost all of the past values which limits 
the values of instantaneous volatility to a constant value after a transient time. As 
we increase the value of 0, we let the dynamics be more fiexible and therefore the 
volatility is able to fiuctuate, resulting in a kurtosis excess. For small values of W, 
the Hurst exponent is slenderly different from 1/2 and the value of the Hurst exponent 
increases with W. However, because of the small value of b, the rise of W turns out the 
distribution of z barely undistinguishable from a Gaussian. This behaviour is described 
in Fig. [TJ We have obtained a Gaussian distribution and a Hurst exponent H = 0.5 
for small values of (0 = 0.1) and W [W = 5). When we augment the value of 
the threshold, = 5, the system is loose and the instantaneous volatility is able to 
fluctuate leading to the emergence of tails (g = 1.09) and a subtle increase of the Hurst 
exponent {H = 0.52 ± 0.01). Hiking up both W and {W = 75 and = 2), we have 
achieved large values of the Hurst exponent [H = 0.58 ± 0.02), but the small value of b 
is not sufficient to induce relevant fiuctuations, bringing on a distribution that is almost 
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Gaussian (g = 1.02). The distribution fittings were assessed by computing the critical 
value = 1 — acrit from the Kolmogorov-Smirnov test [IHj that are equal to 0.9634 
and 0.9454, respectively. 




Figure 1: Left column; Probability density functions P{z) vs z in a log-linear scale; 
Right column: Fluctuation function F{£) vs i for \z\ in a log-log scale. The values of 
the model parameters are: (j) = 0-lj W = 5 yielding q = I and H = 0.5 ± 0.01 (upper 
panels); cj) = 5,W = 5 yielding q = 1.09 ± 0.01 and H = 0.52 ± 0.01 (middle panels); 
(j) = 2,W = 75 yielding q = 1.02±0.01 and H = 0.58±0.02 (lower panels). The results 
have been obtained from series of 4 x 10^ elements and the numerical adjustment of 
P(z) gave values of x^/n never greater than 0.00003, with R never smaller than 0.998. 

As we increase the value of b, we favour the contribution of the past values of the 
price dynamics, thus, for the same value of W we are capable of achieving larger values 
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of the kurtosis excess, that we represent by means of the increase of the q index. The 
same occurs for the Hurst exponent. This general scenery is illustrated in Fig. [2] for 
the value b = 0.998, where we present the dependence of q and H with 0, for different 
choices of W. Again, the higher W, the lower the tail index q, because the extension of 
the memory surges a weakening of the fluctuations in the volatility. The opposite occurs 
with the Hurst exponent, which increases towards unit (ballistic regime) as we consider 
W larger, for obvious reasons. In all the cases of (6, W) investigated, we verified that 
both q and H augment with 0. The assessment of the numerical adjustments is provided 
in Tab. [T]in the form of the P^^-^ critical values from the Kolmogorov-Smirnov test 
The only case we obtained a value 1 (within a five-digit precision) was for the pair 
= 10 and = 5, which results in a value quite close to the limit of finite second-order 
moment (a fat-tailed distribution with g = 5/3). At this point it is worth saying that 
we have investigated the likelihood of other well-known continuous distributions, such 
as the stretched-exponential, the simple t-Student, Levy, and Gaussian. Nonetheless, 
the fittings carried with Eq. (|9]) outperformed every other analysed distribution. 

Concerning the instantaneous volatility, at, we verified that the Dirac delta 
distribution, p (a) = 6 {a — 1), starts misshaping and short tails appear as we depict in 
Fig. m (upper panel) for the case b = 0.998, W = 75 and = 0.25. Considering this 
particular case, we can present relevant evidence of the effectiveness of our proposed 
probability distribution approach. The empirical distribution function in the upper 
panel of Fig. H] may be simply approximated by 

fj-c if ^ ^ 1 

P[^)='{ , (13) 

[1 — f) 6 {a — 1) otherwise 

with c > 0, / < 1, and a G (1 — c, 1 -|- c); when / = we recover the homoscedastic 
process distribution as a particular case. Reminding that at each time step the 
distribution is a Gaussian (conditioned to a time-dependent value of a) the long-term 
distribution is, 

"l + C 1 



P{z) = 

which gives (Ref. [39 
P{z) = 



p[a) 

sec. 3.351) 
/ 



/2 na 



exp 



2a2 



da. 



(14) 



4V2 7rc 



Ei 



Ei 



2(1 + c) 



+ 



/ 



/2 7r 



exp 



2(l-c) 

where Ei [.] is the Exponential Integral function (see e.g. Ref. |1H])- Considering c = 1/2 
(which is appropriate to the case shown) and taking for the sake of simplicity / = 1/2, 
we obtain the function presented in Fig. [^, the kurtosis of which is k, = ~ 3.47 



3125 



(making use of Ref. [39], sec. 5.221). The accordance between this distribution and the 
empirical distribution is quite remarkable since it emerges from no numerical adjustment 

I Actually, this curve is represented in the scaled variable z/cr so that the standard deviation, which 



is originally equal to 



{c+iy+{c-iy+6c 

12c 



, becomes equal to one, like in other depicted distributions. 
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Figure 2: Upper panel: Value of the tail index q vs parameter for several values 
of W and h = 0.998 according to the adjustment procedures mentioned in the text. 
Lower panel: Hurst exponent H vs 4>. The results have been obtained from series 
of 4 X 10^ elements and the numerical adjustment of P{z) gave values of jn never 
greater than 0.00003 with never smaller than 0.9998. Regarding the values of the 
Hurst exponent, the absolute error has never been greater that 0.015 and a linear 
coefficient R > 0.999. 



and can be further improved by tuning the values of / and c. Regardless, this kurtosis 
value is only 2.2% larger than our numerical adjustment (see Table [1] for the goodness 
of fitting). Furthermore, comparing the distributions by means of the symmetrised 
Kullback-Leibler divergence KL = I (^J P{z) In p^^dz + JP' (z) In ^dz^ , we obtain 
a value of 0.00014 that is 19 times smaller than the distance between our fitting and 
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a Gaussian. These results show that the PDF of Eq. ([9]) not only provides a good 
description of the data, but it is much more manageable as well. 




Figure 3: The points represent the empirical distribution function for b = 0.998, 
(j) = 0.25 and W ~ 75; the dashed red hnc is our adjustment with Eq. © with 
q=l.l± 0.01, V = I and B ^ {q- l)/(5 - 3g) [x^/n = 0.00003 and = 0.9986]; 
the green hne is PDF with / = c = 1/2 and the dotted cyan hne is the Normal 
distribution. 



Cases for which the kurtosis excess is relevant (g > 5/4) stem from wider 
distributions of a (see the lower panel of Fig. H]). Actually, it is the emergence of larger 
values of the instantaneous volatility that brings forth fat tails. Although we have not 
been successful in describing the whole distribution, we have verified that, for values of 
q > 5/4, the distribution p (a) is very well described by a type-2 Gumbel distribution, 

p (a) a exp [-(3 a'^] a-'^-\ (16) 

and after certain value of a the distribution sharply decreases according to a power-law 
with a large exponent. We credit this sheer fall to the threshold 0, which introduces a 
sharp change in the dynamical regime of the volatility and thus in its statistics. 

In finance, such a cut-off is more than plausible as real markets do suspend trading 
when large price fluctuations occur. This also grants feasibility to descriptions based 
on truncated power-law distributions [6]. Moreover, a fall off is also presented in the 
quantity Ue of Fig. 3 in Ref. [H]. It is known that for heteroscedastic models the tail 
behavior of the long-term distribution is governed by the asymptotic limit of p{a) when 
a tends to infinity. For the case of distribution (I16p . this limit is the power- law cr"^~^ 
and therefore we can verify that the asymptotic behaviour of the long-term distribution 
of the variable z, 



lim P (z) 

|2|— >-00 



lim p (a) 

.a—^cx 



exp 



2(t2 



da 



[17) 




Figure 4: Probability density function of the instantaneous volatility p{a) vs a for two 
different cases with b — 0.9998. Upper panel: (j) = 0.25 and W = 75 which leads to a 
sharply peaked distribution around cr = 1 and to a P{z) tail index q ~ 1.1. Lower panel: 
<j) = 2.5 and W = 25 that results in a broader distribution largely described by a type-2 
Gumbel distribution with /? = 0.421 ± 0.002 and C = 2.323 ± 0.006 [x^/n = 0.00011 
and B? = 0.9982). For cr ^ 5, p{(j) changes its behavior to a faster decay with an 
exponent equal to 8.4 ± 0.2 represented by the gray symbols. The ANOVA test of the 
type-2 Gumbel adjustment (up to cr '--^ 5) have yielded a sum of squares of 0.03553 (323 
degrees of freedom) and 20.3684 (2 degrees of freedom) for the error and the model, 
respectively. The uncorrected value of the sum of squares is 20.4039 (325 degrees of 
freedom) and the corrected total is 12.5941 (324 degrees of freedom). The empirical 
distribution function has been obtained from series of 4 x 10^ elements. 



da. 
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yields a power-law distribution (applying Ref. [39], sec. 3.326), 

P{z)^\z\-'^, (z^oo). (18) 

For p{a) following an exponential decay in the form exp[— 70"], a similar procedure 
yields. 



P(z)^G 



0, 1, 1 



[z-^00), (19) 



where G [.] is the Meijer G-function [39], HH] 

It is worth saying that we can reduce the number of parameters to a, b and </>, i.e., 
apply the simple ARCH{1) process, and obtain fat tails and persistence still. 

II. 2. Comparison with a real system 

Following this picture, we can now look for a set of parameters that enable us to replicate 
a historic series such as the daily (adjustec^JJ]) log-index fluctuations, {r (t)}, of the SP500 
stock index, {S (t)}, between 3rd January 1950 and 12th April 2010 (14380 data points) 
with, 

r{t) = lnS{t + l)-lnS{t). (20) 

Inspecting over a grid of values of b, W and 0, we have noted that the values of 0.9998, 
22 and 1.125, respectively, yield values of q and H for {zt} that are in good agreement 
with a prior analysis of {r (t)} which gave q = 1.48 ± 0.02 (using a simple t-Student 
distribution) and q = 1.51 ± 0.02 (g' = 1.47 ± 0.003, u = 0.92 ± 0.008) [x^/n = 0.00003, 
r2 = 0.999 and F^g = 0.9276] (using the PDF of Eq. (|9])) and persistence exponent 
H = 0.86 ± 0.03 (see Fig. 4). Comparing the numerical distribution of our model with 
the data we obtained Dks = 0.014 and a P^s critical value equal to 0.991 from the two- 
sample Kolmogorov-Smirnov test [l6], while the comparison between the distribution 
of the numerical procedure and the adjustment of the SP500 empirical distribution 
function yielded P^^ = 0.9998. Once again we have tested other possible numerical 
adjustments and the only other relevant distribution was the stretched exponential with 
z/ = 1.3 ± 0.02 (g' = 1) which has given a P^g different from 1 (P;;^^ = 0.9999), but a 
significantly larger value of [x^/n = 0.00009, = 0.9963]. 

It is worthy to be mentioned that all the three values of the parameters are plausible. 
First, within an application context, b is traditionally a value robustly greater than 0.9. 
Second, W is close to the number of business days in a month and last, but not least, 
(j) is somewhat above the average level of the mean variance presented above. This 
provides us with a very interesting picture of the dynamics. Specifically, at a relevant 

§ In an effort to obtain a full description of p [a) wc also used a function such as f (x) = 

Zcxp [— /3a;~^] (^1 ~ 5 + 5 cxp ^ which allows the appearance of a crossover from a power 
law to an exponential decay. Nonetheless, it did not provide better results. 

II The adjusted values of the index take into account dividend payments and splits occurred in a 
particular day. 
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approximation we can describe this particular system as monitoring the magnitude of its 
past fluctuations with a characteristic scale of a month, from which it computes the level 
of impact resulting in an excess of volatility. Actually one month moving averages are 
established indicators in quantitative analyses of flnancial markets. When the volatility 
in a period of the same order of magnitude of W surpasses the value 0/ (1 — fe), then the 
system recalls previous periods of time, no matter how long they happened, in which a 
signiflcant level of volatility excess occurred. Those periods are then averaged in order 
to determine the level of instantaneous volatility a^. 




1000 ^ 10000 



Figure 5: Upper panels: On the left side, Probability density function P{z) vs z for 
b = 0.998, W = 22 and cj) = 1.125 (full line) [q = 1.49 ±0.01 with x'^/n = 0.000025 and 
i?2 = 0.9984] and the S'P500 daily log-index fluctuations (symbols) [q = 1.48 ± 0.02 
with x^/" = 0.00004 and = 0.996] in the log-linear scale and on the right side 
the complementary cumulative distribution function D{x > z) vs z for case shown on 
the left. Lower panel: Fluctuation function F{£) vs i for the same parameters above 
[H = 0.85 ± 0.02, with R = 0.998] (red circles) and for the S'P500 daily log-index 
fluctuations [H = 0.86 ± 0.03, with R ~ 0.997] (black squares) in a log-log scale. 



III. Discussion 

We have studied a generalisation of the well-known ARCH process born in a flnancial 
context. Our proposal differs from other generalisations, since it adds to heteroscedastic 
dynamics the ability to reproduce systems where cognitive traits exist or systems 
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showing typical cut-off limiting values. In the former case, when present circumstances 
are close to extreme and impacting events, the dynamics switches to the memory of 
abnormal events. By poring over the set of parameters of the problem, namely the 
impact of past values, b, the memory scale, W, and the volatility threshold, 0, we have 
verified that we are able to obtain times series showing fat tails for the probability density 
function and strong persistence for the magnitudes of the stochastic variable (directly 
related to the instantaneous volatility), as it happens in several processes studied within 
the context of complexity. In order to describe the usefulness of our model we have 
applied it to mimic the fluctuations of the stock index SP500, we verified that the best 
values reproducing the features of its time series are W close to one business month and 
(j) greater that the mean variance of the process which is much larger than the normal 
level of volatility for which trading is not taken into account. Concerning the volatility, 
we have noticed that for the problems of interest {i.e., fat tails and strong persistence), 
the distributions are very well described by a type-2 Gumbel distribution in large part 
of the domain, which explains the emergence of the tails. 
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